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ABSTRACT 

The introduction to this selective bibliography on 
the use of computers in teaching statistics provides a briaf reviefcf 
of the role of the computer in statistics; the role of statistical 
packages; how statistical packages should be used in instructioa; 
science, statistics, and data analysis; and choice and evaluation of 
statistical packages. Items listed in the bibliography are concerned 
both with instructional applications of computers and with the 
evaluation and comparison of statistical softv/are. A copy of a 
questionnaire seeking information on available statistical software 
is appended, together with selected responses* These questionnaires 
were used to compile the data reported in the Index of Publicly 
Available Statistical Software, by Kohm, Ryan^ and Velleman (19771. 
(LLS) 
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The use : t he c:^;?pj' rr fcr sta *-jr~t ical a?alysGS has 
z-^-tZ qrowinq : - le^ips ^ ?d . oands in ::ast decade. Sojne 

Z-HL-rect evidv. i of tn^s :. dpld qrcwt. are: th-^ increased 
"zr"3crtion of -rticles repcrt-inq compui^r bared results in 
srr ^ ■::ist ical ^ rnals; tz^ Idcqe p. :2iber c publicly 

=:i7aij.able st _sticai .-'^ftware and pc.r:::aqes (r ported by 
Ryan^ a:. a "^^ellenar (1577), €.q.) ::^ny of v ich becarne 
iLvdilable onl' within ris . :^st five year: the ci -^.tion of a 
5e~ion on Statistical :cinc.ti:iq i^i* in th h^-^ :ican 
Sta tistical -iHisc-ia tir n ; th i-itiati-j of Us::..z Groups, 
Newsletters^ oerial B?=t::^3, e ssoc-..: '-^^ ' yit:h 

particular sr^ : sz-^zal packa res suci: S AS and 3?:. 3, es 

well as incr- : 'u7L'£.^3 f ziicz^ CQiixseSr ^^ric^hops, 

tutorials^ an-i :r:- il:-:e ^c: vVcriouj pd-rri^ig-^s ; and dh- 

The use of . ne : ^ jte l an: sta tlst^.ca 1 pace 7 es for 
statistical _nstructi..n • r-.'ie ccher hand, has been .'roviaq 

at a much slowe^* P'icie,, ir. corDar--Son -ith oth^^ usaqe, 
whether measured a r - — : iy :n czali'^j. Sys ic and 

regular use of t rr-^rer ir "-ea-hiag c^atist-.- far 

from being univer:3::. , ^v.ci. itay st:. 11 he :'C^sidere- ... :e the 
exception rather ^.jfi t::.'e ru-, ^: all ._e-^'^^s of 

instruction. T::.er:i z:z-^ aaTi^ rea-scns f:r ^he a pp:-:-^- *: slow 
growth in c om put=^ r-- ni^steu:. iTStrructron i3tati3ti_:ii: . Two 
of the mai- ones ar: . (1) ±ir:y instru^trrs of sr:r:" rtics^ 
who are knowledqec.^ le in 1t3 S"*r.::ject anc its tr^ — :::ional 
methods of i^nstrucrr ^re x^fa-x . lia r v^^th , or ill-a.— ease 
in, the use of th^ coT.ruter iDi general <■ r: the pa— i i^i-^ars 
in the use of statistical :;^i<ages=) cr :le: effective U3e of 
such facilities zo lae Lnstromian of stET:isr:^al 

topics* Consequenti-^., t^ti material ard lecture nctes :hat 
are independent cf cosfuter and rerrext advances iz its 

software are still v zz-:li ur^J. (2) >t those instructors 
who are Icnowledgeaole i; statistical computing, the 
following type cf c^-er^lae .t is cften expressed, "it is 
sufficiently difficult zo zo ^p^z the st^zri-stic materiel in 
a course without hor it^^; tc t^ach th^ students compt irig 
(such statements are jt:t«?u £cr ;^raniec ty laments about the 
students ' lack of tacy c 'ouud tr interest in ma the ma itcs, 
statistics, or computinq, tt ould be impossible to teach 
them the use of 5it at ist ::C(i ritx. ^.^qes (or the computer) and 
still have time laft to teac-: th^-.r. statistics*" 

Both the above reascuri can be iittributed to the same 
cause, to a large extent., ^^is cause might be termed 
"unfriendly compu teres^" . I^^ is guite apparent that 
dif f icult-^to-learn or unnartrrai languages associated with 
operating systems aad srHti . t ical packages have been 
deterrents to statistical ftoimp. t^ag for many instructors and 
students who otherwise wcuj^ have iraae. considerably more use 
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3f them. Fortunately, the situation is steadily improving: 
packaqe developers are qenerally paying tjcre attention to 
the userin ter f ace aspects of their products; user pianuals 
dre better vritten; and there are recently developed 
systems such as IDA and Kinitab which are particularly 
suitable for use in conjunction with statistical instruction 
::ecause of the simplicity of the languages. 

In this article^ the author provides the reader with an 
outline of basic inforaaticn^ as veil as sources and 
references for further iaf or raation, on topics directly 
related to the use of computers fcr statistical instruction. 
We believe such information will be useful to those who are 
contemplating the introduction of statistical packages as an 
aid to their teaching* , Some. of the information should be 
useful even to those whc are already using the computer in 
their statistical instruction because they may not be aware 
of much of the existing rescurses and literature references 
assembled here. 



2a THE ROLE OF TH2 CCMPUTER IN STATISTICS 



The computer is playing a sajcr role today in various 
aspects of research, application, and instruction in 
statistics* In view of the treiaendous power of existing 
computers and the availability of high quality software, 
statisticians not only caa perform traditional computing 
tasks {associated with well established methodology) with 
utmost ease accompanied by a high degree of numerical 
precision, but can also approach old problems from _ new 
points of view much of yhich would not have been possible 
without the computer to perform the otherwise ejrtremely 
tedious comput at iop.> < -^ong the major areas of statistical 
a pplica tions in wh ;:h . ^t--^ computer is an in disp ens ibl 6 tool 
-are: 

A. — Monte Ca ..v fr es and Simulation Experiments, 
— Routine Statistical Analysis of Large Datasets, 

C. — New Methodologies Involving Large Amounts of 

Computing, e.g.. Robust Methods, Iterative Methods, 
a nd S ub sa m p li n g e t ho d s , 

D. — Exploratory Data Analysis and Model Building, 

E. — Validation of Statistical Assumptions via Residuals 

Analyses and Graphical Methods. 
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3. THE HOLE CF STATISTICAL PACKAGES 



Current usage of statistical packages, whether measur-i 
by the nuaber of people €nqaq€d in those activities or zz^ 
aaount of computer time used ^ prcrabLy falls within t :e 
follow inq cat eqor ies listed in d6cr-3c.sinq crde r of usacre : 

A. — Descriptive ^nd Tabular Ana-^vsis. TooLs for use's 
in their pre.paraticn of of reports involvig tabu._, r 
and qnaphical displays cf data or samaries f 
da ta • 

— Tools' for statisticians to facili.tate t: 
performance of tasks in 2* A-E. 

€• — Aids to teachers in the instruction of statistics. 



4. HOW STATISTICAL PACKAGES SHOULD BE USED IN INSTEOCTICN 



We first propose some gene ral quidel ine s on the use of 
statistical packages in instruction, independent of how they 
are used to convey specific statistical concepts or to 
illustrate specific methodologies* They are as follows: 



A- — Packaqes should siiiply be usad as a tcol in the 
proper analysis of statistical data. A proper 
analysis must net te constrained by what a 
particular package can or cannot do- Often an 
analysis may require the use of several different 
packages because of the limitations of capabilities 
in the "standard" packages. Sometimes, none of the 
existing packages will deliver 'ifhat needs to be 
done on a particular prcbleo, in which case special 
programs or routines must be written to perform the 
necessary tasks. 

B. — The roles of statistics, statistical packages, and 
their interface should be clearly established. It 
is the dury of the instructor to convey tc the 
students the prc per cc Ecepts of statistics and the 
£E2£§,L a polica tion of statist ical met hodology , 
regardless of what statistical packages are 
available, what procedures are avail able w ir.hin a 
particular package, cr what sorts of outputs are 
available within a particular procedure of a 
part icular package. 

Sta tistical packages (or the instru ct icz. of 
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citec Th 3se articles deal with the actual us ^ of the 

compijecz or statistical packages at various _ = vels of 
instrctziion. on a large number of topics* Th% list of 
artic_^^ is only intended to serve as a representative 
sampl- recent work on computer-based methods or material 
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moderately small. While th€ ccnputer and statistical 
packages a_:e used to facilitate computational tasks 
associauior with the execution of statistical methodology, 
one important aspect of statistics its unQ::arlying 

philosophy — and the impact of the computer on the 
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realization of the phi: -.-cphy^ is ofte: overlooked i: 
instruction^ esracially . ;t elementary staTistical courses 
We shall briefly'' liscuss ^'h^t philosophy \nc approach in th 
next section^ 



:c ..:nce^ sta 



AND DAI. ANALYSIS 



Trad 
met hods c ' 
sta tistic 
dat a. 
clea r~c u 
that bea: 
to encci:-^ 



:icr..a_ te xtbook:.: o 
miction oftiH^n 
a^ .: of proL^en. 
::G:: proHeiDS 
l^-def ined e ; ^ 
_ie reseiDblanc . 



statistics and convent io na- 
1 short c:: the real goals o: 
Ivinq and analysis of rea. 
zrten pre sen "ed in the f orm o: 
r._ BBS in con f iriaat ory analysis 
z: the problems one is likel; 



.n the Seal iar^ d. 



St ^^^-.£• 



scie nee 
one-pas^ 
displa Y 
transfo 
carry 
arr ivi:. 
compute 
the reai 
philosc 
excerpt : 



.ni; 



:al analysis ii 
l7Zi nq sta tisti 
r. In addit™ 
-s data for 
-ecesssry ^ 
analysis th 
one tentative 

-struction qre- 
and proper a 

:d a pproach i 



a delicate blend cf art and 
r:al data is seldora^ if ever^ a 
zr. tc routine examination and 

^-enical cccuracy, edit and 
statistician usually has to 

-h several iterations before 

:nclu£icns . The use of the 
:.y facilitates the teaching of 
reach tc statistics. Such a 

articulated in the follovring 



"L 1 -a analvsis^ and zhe parts of statistics which 
adhere c it, must then take on the characteristics of a 
science rather than those cf mathematics, specif icallv: 

(1) Data analysis must seek for scope and 
usef iness rather than security. 

(2) Data analysis must be willing to err 
moderately often in order that inadequate 
evidence shall more often suggest the right 
answer. 

(3) Data analysis must use mathematical argument 
and mathematical results as bases for 
judgment rather than as basis for proof or 
stamps or validity." (p. 6) 

"If data analysis is to be well done? much of it must 
be a matter of judgment, and 'thecryVr whether statistical 
or non-statistical^ will have tc guide, net command (p. 10) 
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wronq question^ which can always be made precise. (p. 13) 

JOHN TOKEI (1963) "The future of Data Analys-^^' in 
Annals of H athegat ical St at i s tics. 



What Tukey said fifteen years ago reirains t-^iely today 
because exploratory data analysis entails consid^^r ibly mors 
computation, through trials and refinements, th;:- its one- 
pass conf irma tary analysis counterpart. Such an ar oroach is 
possiblSr but iiapr act ical, without the s: :crt of 
appropriate ccsputing software, Eox(1976) re^ -.a^ed the 
same theme in a recent article in which he said 



"In the inferential stage, the analyst acts as a 
sponsor of the model. Ccnditicnal on the a§sun :ti on of its 
truth he selects the best statistical procedure for analysis 
of the data- Having ccmpleted the analysis, acwever, he 
must switch his role from sponsor to critic. Conditional 
now on the contrary assuirpticn that the ir del may be 
seriously faulty in one or more suspected or unsuspected 
ways he applies appropriate diagnostic checks involving 
various kinds of residual analysis," (p, 793) 

"The symptoms of <coc kbcokery) are a tendency to force 
all problems into the acids of one or two routine 
techniques, insufficient thought being given to the real 
objectives of the investiqaticr cr to the relevance of the 
assumptions implied by the imposed methods." (?. "57) 

"Mat heraat ist ry is characterized by development ^>'^: 
theory for theory's sake, which since it seldoiu toucies dr^- 
with practice, has a tendency to redefine the problem rather 
than solve it." (p. 797) 

GEORGE EOX (1 976) "Science and Statistics" in 
Journal of the Ame rica n Sta tist ica 1 Association, 



The iterative process of being a sponsor and a critic 
of various tentative models is most effectively carried out 
with the aid of appropriate interactive statistical packages 
cr lanquages. The reason is that although the same steps of 
analysis can be carried out using packages run in the batch- 
mode, the long wai ts in turnaround between jo b submission 
and completion generally inhabit (at least deter) the 
statistician or the student from trying out a large number 
of small changes in the analysis or exploring as many 
alternative modes of analysis as they would under an 
interactive enviroment , 

The interactive system IDA was created at a tine and 
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eaviroment when : : 6 hours turnaround for batch jobs was 
not uncommon^ ir ziql that exploratory analyses could te 
carried out in c . vrticu with a course in data analysis 
without the f rusr-~- --i^ons of long delays associated with 
batch computing^ „.rcr its creation in 1972, it has not 
cnly been used :::^:r'rily and effectively as an aid to 
statistical instr i,t:::c n , bat has also been a useful tool in 
aany proiects of serious research and real problem solving, 
Linq and Roberta (1975) describes the interface between 
interactive compu^iinq and data analysis as follows: 



••Interactive computing articulates extremely well with 
the requirements ox enlightened statistical analysis, in 
which the analyst examines data to help foraulate an 
appropriate stat-istical acdel, applies diagnostic checks to 
criticize the ffixsel, revises the model as necessary, and 
continues the process until he is satisfied with that he has 
done about as ^.11 as he can within the given constraints of 
time and space.. Only then dees the final statistical 
analysis ensue^ and that is relatively cut and dried." (p. 
411) 

"UnliJce x^any textbook descriptions of statistical 
analysis, and also unlike many statistical packages (even in 
timesharing), IDA does net presune that an analysis need be 
done in a fixed sequence of operations. In order to exploit 
the capahabilities of user-sachine iteration, IDA breaks 
down statistical operaticns into relatively small modules 
that can be executed in any sequence that is logically 
possible. This permits the user to take advantage of what 
he has learned at each stage before deciding what to do 
next. Except for certain obvicus restrictions in the order 
of execution of commands (for example, the user must first 
enter his data before he can edit, display, or operate on 
it) , the user has complete freedom in going from any one 
command to any other. Such freedom is particularly 

important and useful in exploratory data analysis for 
regression, where, in the process of deciding on a model 
that does justice to what seems to be happening in the data, 
the user generally needs to exauine residuals and fitted 
values for tentative models, try transformations, possibly 
delete observations to cttain insight into the effect of 
outliers, and examine various plots an ^ numerical model- 
adequacy checks, not in any specific order but in an order 
that appears appropriate for the problem at hand, depending 
on the feedback and interpreta tier of intermediate results. 
Such a mode of analysis, though possible under other 
systems ^ is gene rally m uch more cumber so me to execute than 
in IDA." (p. 415) , 

ROBERT LING AND HARRY BOEERTS (1975) "ICA: An Approach 
to Interactive Data Analysis in Teaching and Research" in 

jjojiiaai fil Eusiness . 
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6. CHOICE AND EVALUATION CF STATISTICAL PACKAGES 



A useful and up-tc-date reference cn the publicly 
available statistical software packages is the iL.dex of 
5ta tistica 1 Soft ware , edited by Fobert Kohm, Thooas A* 

Syan^ Jr., and Paul Velleman; available in microfiche 

form in the P roceedi ngs of the Statistical Comp uting 
Section, Ame rican Statist ica l Assoc i a tica (197 7) . It 
consists of a conpilaticn cf the responses by 56 package 
developers reqardiiiq the capability, availability, 
portability, cost, and other relevant information about each 
package. In addit ioa, abstracts cf the packages (vri tten b y 
their developers) are provided, 

Kolm, Ryan, and Velleman describe the purpose and 
content of the Index as follows: 

"The purpose of the index is to provide a single 
reference capable of answering many cf the first questions 
that may arise when an individual is trying to determine 
which computer program might best serve his needs. To serve 
this goal, the index consists primarily of three parts. 
Part 1 is a listing of the general capabilities of the 
indexed programs. The goal of this portion of the index is 
to cross reference programs and capafciJ.it ies, e.g., which 
programs do ANOVA, Sisple Data Descriptions, Factor 
Analysis, etc. The second part of the index consists cf 
ffliscellaneoas details about each program. This portion of 
the index is designed to aid the reader in determining 
whether the programs of interest tc a particular user will 
run cn his machine, in what computer lanquaqe it is written, 
in what form is the proqrao available, etc. The third 
portion of the index consists of the names and addresses cf 
the developer, distributer cf the program, distributor of 
the documentation, and the person who completed the 
questionnaire reply. It alsc ccntains an abstract of the 
computer proqram written by the persoa who completed the 
questionnaire, typically the developer. Virtually no 
quidelines were given to the developer in the writing of the 
abstract, so they are quite different in appearance. The 
abstracts provide the developers with a form in which they 
may describe special features cf their program that are not 
adequately covered by the questionnaire. Hany developers 
have chosen to describe their strcnq pcints and those 
features which they feel make their programs unique." 

The author believes the Index will serve its goals well 
and that the Index (and its future editions) will prove to 
be an indispensible source cf information about the 
available statistical software. The questionnaire which 
qenerated the Index is reproduced in Appendix A, and the 
tabulation of the responses is reproduced in Appendix B. 
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The Index makes no attempt to evaluate any of the 
indexed packages. There are nary recent articles which deal 
with the evaluation and comparison of packages or particular 
routines within packages* A list cf such articles is given 
in subsection B of the Reference section. 

In choosing one cr more statistical packages for 
instructional purposes, care should be given to the "ease of 
use", *'handiness", or "friend liness" of the packages. A 
package which ha s a complicated control langu age or data 
structure will necessitate students spending much time 
learning to use the package itself, thus leaving much less 
time learning about statistics. Existing packages leave 
much to be desired in their user-interface implementations. 
"Unfriendly" systems and packages still abound, and they 
should be avoided as instructional aid in statistics, 
especia lly in courses ta ugh t at an eleaen t ary level. M any 
universities find it necessary to offer special 1 -credit 
hour courses just to teach students the control language of 
one statistical package, such as SkS (which is one of the 
more "friendly" packages). The fear and frustration 
students may experience in usinq less " f rie nd ly" packages 
can be extrapolated from that. 



7. OTHEE SCOBCES OF INFORHATION 



Apart from journal articles, meetings, and other 
activities affiliated with scientific societies, there are 
numerous special publications ard special organisations that 
provide services and information that ai^e pertinent to the 
use of the computer in statistical instruction. Some of 
these are described below: 

A* Information on how computers are used for 
instructional purposes (not limited to statistical 
instr actiovi) by 57 educational institutions ranging in 
level from eXementary schools to major universities has been 
compiled by the Human Resources Resea rch Orga niza ticn, 
published as the Academic C ompu ting Directory (1978) : The 
Directorr not only identifies the "exemplary" institutions,, 
but provides information on the computers they have, how 
they are used, andthe reasons the instit uti on s were 
selected as "exemplar" of academic computing. In addition, 
the name and address of a contact person at each institution 
- one who will answer guestions from inguirers - is provided 
for each ontry. Copies of the Directory are available for 
$3.95 each from: 

Human Resources Research Organization 
300 N. WashingTon Street 
Alexandria, VA 2231U 
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B. EDONET is a national network of colleges and 
universities formed to prcaote the sharing cf computer-based 
resources in hiqher education. The operation of EDUNET is 
overseen by the Planning Council cr Computing in Education 
and Research^ which was formed in 1974. The Planning 
Council is one of several special activities of EDOCOM - a 
non-profit organization established to further cooperative 
efforts aiBonq institutions of hiqher learning. EDONET makes 
possible the connection of any ccBputer terminal to many 
computer centers across the country. Among the statistical 
proqr ams and packages accessible under EDONET (January^ 
1978) are; APL, BMDP, CATAEDIT, IHSL, HAUOYA, aiNITAB II, 
SPSS^ TSP (at Stanford);' BMDr DATATEXT, LIDAp TPI, TSP (at 
Yale) ; FCST2***, FCST3***, FCSTU***, ST ATNC7*** , 
STATPHOE***, STATIS*** (at Dartacuth); HINITAB, SIATJOB (at 
Wisconsin); SCSS (at Notre Dame); and many ether packages at 
other universities such as Cornell, MIT, Princeton, and 
Rice. For information about available resources and 
services, documentation, charges and accounting, and other 
EDUNET matters, one nay call the toll-free EDONET Hotline 
(800) 257-9505, or write to: 

EDUNET Central 
P. 0. Bo7 36U 

Princeton, NJ C3540 

and current information items about EDONET are 
: ■ to nembers (and tc ethers upon request) by the 
i.aic, ..iv-r. oas EDONET News and EEUCCJI Bullet in • 

C* The Symposium on the Interface of Computer Science 
and Statistics, held annually since 1967, consists of a 
series of workshops on topics involving statistical 
computing. Papers presented at the workshops appear in the 
proceedinqs of the Annual Sutapcsiua. Articles on the use of 
computers in statistical ii)Struction has appeared regularly 
in recent Proceedings, 

D, The Association for Educational Data systems (AEDS) 
is an organization which spcnscrs cumerous computer related 
activities in education on a variety of topics, ^ including, 
computer- ass is ted instruct ion, ccipputer- ma naged instr uction , 
computer-assisted guidance, computer-assisted testing, and 
so on. Although most cf its sponsored activities are not 
addressed specifically to statistical instruction, many of 
them do apply tc ccnputer assisted instruction in 
statistics. Its affiliated publications (and annual 
subscription rate) are: the AEDS Journal (520) , the AEDS 
Monitor ($12), and the AEDS Bulletin ($5), all published 
guarterly* AEDS membership dues are $25 fer year (student 
$10) which include subscriptions tc the publications listed 
above- 
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AVAILABLE STATISTICAL SOFTWARE QUESTIONNAIRE 



1-1 Cooplete nane for the packiage or program? 



K2 Short nane or abbreviation for the prograa? 



1.3 Principal developer: 

Name 

Organization^, 

Add re 98 



Telephone ( ) 

l.^l Distributor If different froo (1.3) above: 

Name ■ 

Organ l2atlon____ - 

De pa rtoen t_ 

Address ■ 



Telephone ( ) • — 

1.5 Distributor of documentation If different from (l.M): 

Name — 

Ori^anlzat Ion 

Deoar tment 

Address - . -- — • ■ 



1.6 Person r^^aponalble for preparation of this reply If different from 

Name ^ 

Organization ' — — 

Department . . 

Street ^ 

ty^ I State Zip 

Telephone ( ) — ~ — ■■ 

1.7 Date of laat program release and Identification (e.g. Version 5.3) 
Date Identification 

1.8 Expected date of next release. 
Date. 



Telephone ( 



) 




Does Itt^v progran run In batch node? 



Page 2 



(a) Yes 

(b) No • 

If your prcgrata Is Interactive, can It run as a (One or more answers) 

(a) Fully Interactive progran with prompting questions. 

(b) Fully Interactive program with answers Immediately 
available, but without prompting questions. 

(c) As a control card checker (I.e. 
essentially Interactive checking of control 
cards for large batch-llke analyses). 

Program availability. (One or luore replies) 

'a) Mot available for export to other computation centers? 

(b) Passively distributed to other centers? 

(c) actively distributed to other centers? 

(d) Available through time-sharing system? 

If your program 1« available for export, how is it available? 
(One or more answers) 

<a) Under a rental agreement. Coot__ 

( b ) For purcha se . Cost, , 

(c) Through some computer association such as SHARE, 
VIM, CUBE, DECUS, COMPyiT, etc. 

If your program is available for export, how is it transported? 

(a) In a form which is compatible with one computer brand only. 

(b) In some readily transportable language (e.g. ANSI Fortran). 

(c) In separate versions for different computer brands. 

If your program is available for export, approximately 
how many centers have received copies of your program? 



Which of the following documentations does your program have? 
(One or more answers.) 

(a) Primer intended for novice users or studenta with 
worked out exam pi es. 

(b) Reference raanual. 

(c) On line "HELP" selectively available "on request." 

(d) Implementation or systems programmers guide, 
(e ) Other . 
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Pago 3 

2.8 Please check all coapllers or interpretera required. 



(a) Fortran II 

(b) Fortran IV 

ANSI 

(c) Not verified 

' (d) Verified on Bell Laba PFORT verifier 

(c) Verified on ether verifier 

(f) Extended FORTRAN IV 

(g) COBOL 

(h) PL/1 

(i) Asseabler 

(J) BASIC 

(k) APL 

(1) Algol 60 

(m) Algol 68 

(n) Other (please specify) . ^- 

(o) Special coapiler (please apecify) . 

2.9 On which computer aakea has the program been 
successfully run. 

Manufacturer Model Operating system (if important) 



2.10 Is your program: 

(a) Under continuing development. 

(b) Being maintained only. 

(c) No longer being maintained. 

2.11 Are local program modifications permitted? 

(a) Yes 

(b) No 

2.12 Are local additions of capabilities permitted? 

(a) Yes 

(b) No 

2 13 Has your program or package (or a technique unique to 

vour program or package) ever been reviewed or deocrlbed in a widely 
distributed Journal or at a society meeting? 

(a) Yes (pleaae list these on a separate sheet of paper) 

(b) No 
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1 Rank your progra««3 strengths with respect to the following 

applications (1 for strongest, 2 for second strongest etc.). 
Do not rank Inappropriate Items. 

(a) The analysis of data. 

(b) Data ..manipulation, editing, table buUdln?. file handling. 

(c) Monte Carlo simulations 
(d) Teaching 

I (e) Other , 

.2 Which of the followlnn would best describe your program? 
( Check only one . > 

(a) A stand alone main program. 

(b) A collection of separate main programa. 

' (c) A special purpose atatlstlcal package or set of 
prograos (e.g. a linear models package). 

(d) An Integrated general purpose system of statistical 

routines running under a monitor. 
(e) A set of subroutines or similar groupings of code. 

.3 What is the principal Intended audience of your program 
In terms of statistical experience? (check only one) 

(a) Conplete novices with little or no statistical background 

; (b) Naive users with one Introductory survey course In 

atatlstlcs. 

(c) A user with moderate exposure to statistics coming 
from advanced courses In statistics or from periodic 

exposure to statistical methodology. 
(d) An advanced or sophisticated uaer possessing an 

extensive background In statistical method and theory. 

I 4 How would you describe your principal Intended audience 
In terms of computer experience? (check only one) 

(a) Novices with no previous exposure to data processing 



or computers . 

(b) Naive users who »ay have used the computer somewhat 

(In a course for example). 

(c) Users with moderate exposure to data processing 
techniques and familiarity with either some languages 

or with experience In using statistical packages. 

(d) Advanced or sophisticated users with programming 

experience and/or data processing expertise. 

3.5 For what general fields Is your package Intended? 
(One or more answers) 

(a) Biological sciences 
(b) The social sciences 

(c) The engineering and physical sciences 

(d) Business and economics 
^ (e) General statistics 

(f) Other(s) • 
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Por th© Questions ^,1 - ^•BO please circle the "ost appropriate anawer using the 
follovrlnff replies: 

C Capability The program or package hao sufficient 

capibllltles In this area to be considered 
as %i f ea ture . 

L Llnlted The prograo or package haa sooe capabilities 

in this area, but they should be considered 
as Halted. 

D Documented The feature can be easily accoapllshed 

using the docuraentation supplied with the 
prograa, but is not a standard (■built-in") 
option . 

F Feasible The feature can be acconplished by users 

with sone expertise in the area, 

I Insufficient The program or package has insufficient 

or "Blank" cjd pab 11 1 1 les in this area to be classified above. 

P Planned The program or package has either limited 

or no capability in this area» but program 
changes are planned to lnclu<}e this area 
in the "C" classification within one year. 

0 Obsolete The program developer considers the program 

or this aspect of the program to be obsolete. 

Programs may be listed in several categories, (For example, a 
nonparametric analysis of variance program would be listed 
both in Analysis of Variance and in Nonparametric Statistics.) 
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C L D F I P 0 ^1.1 Flic bulldlna ond manipulation. 

C L D F I P 0 h.2 Variable and catesory labeling. 

C L 0 F I P 0 1.3 Data tranaforiaatlons (standardization, logs, 

arlthnetlc , etc . ) 

C L D P T P 0 1.*> Hstrlx oowputptlon^ and manipulations. 

C L D F I P 0 ^.5 Sorting and matchlnf;. 



C L D F I P 0 4.6 Case selection/exclusion. 

C L D F I P 0 4.7 Consistency checklnc^ (simple and conpound logical checks) 

C L D P I P 0 4.B Automatic error correction (consistency correction, 

Imputation, etc.) 

Computations 

C L D F I P 0 4.9 Simple data descriptions and statistics (means, 

std.dev, histograoSr t-test, simple linear regression, 
one way AKOVA, blvarlate plotting, etc.) 

C L D P I P 0 4.10 Multiple linear regression. 

C L D F I P 0 4.11 Analy'ils of Variance (balanced data and 

traditional designs). 

C L D F I P 0 4.12 Analysis of Covariance (balanced data and 

traditional designs) . 

C L D F I P 0 4.13 Canonical correlation and principal component analynis. 

C L D F I P 0 4.14 Factor Analysis. 
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C L D F I P 0 1.15 Multivariate analysis of variance and 

discrininant analysis. 

C L D F I P 0 M.16 Non-linear regreasion. 

C L D F I P 0 *I.17 Cluster analysis. 

C L D F I P 0 *.18 Siaple analysis of nulti-way tables (percentages, neans, 

9td . dev . , etc . ) 

C L D F I P 0 «l.19 Uoglincar analysis of aultiway tables. 

C L D F I P 0 i;.20 Other analysis of categorical data. 

C L D F I P 0 1.21 Variance estloation for oooplex sanple designs. 

C L D F I P 0 1.22 Tine series analyses (spectrua analysis, 

Box-Jenkins, forecasting, etc.) 

C L D F I P 0 1.23 Data saoothing and curve fitting 

(exponential saoothinff, splines, etc.) 

C L D F I P 0 1.21 Bayesian atatiatics. 

C L D F I P 0 1.25 Monparaaetric statistics. 

C L D F I P 0 1.26 :BandOB nuaber generation and sinulation. 

C L D F I P 0 1.27 Table printing (aulti-way tables: foraatted, 

category labela, etc.) 

C L D F I P 0 1.28 Graphica (veraatile diaplaya of data and 

interaediate rcsulta on printer, teletype, 
or apeoial graphics hardware. Note: hiatograns 
and plots in which uaera have no control over 
the choice of variablea, acales, or plot ayabola, 
are here defined to be *Liaited* L- 

C L D F I P 0 1.29 Resulta of coaputationa available iMBCd IfltClY 

(not In a subsequent run) aa input to other 
procedures (e.g. residuala treated aa data to 
other procedurea . ) 

C L D F I P 0 1.30 Reaulta of coaputatlon and Interaediate reaults 

(e.g. OR decoapoaitiona, correlation aatricea, etc.) 
output In aachine-readable fora (diak/tape/carda) . . 

---- Thank you ---- 
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Selected Questionnaire Replies 

reproduced (with permission) from 

Kohm, Robert F., Ryan, Thomas A. Jr., and 
Velleman, Paul F. (1977), "Index of Publicly 
Available Statistical Software," microfiche 
in Proceedings of the Statistical Computing 
Section, American Statistical Association . 
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' 1. Pile flanipulatiOD 

• 2. Variable labeliDg 

* 3. Transforflations 

- 4. Matrix Banlpulatloos 
. 5. Sorting A Batcbiog 

- 6. Case selection/eidusloD 

- 7. Consistency checking 

of data 

- 8. Auto error correction 

27. Tabic printing 

(sulti-wsy tables) 

28. Graphics (versatile) 

29. Reuse of coaputationo 

in saBe run 

30. Storage of ooBputationa 

31. Run batch 

32. Run interactively 

33. Statistical level 
3^1. CoBputlng level 
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